Study of MPEG-7 Sound Classification and Retrieval
نویسندگان
چکیده
In this paper, we present a comparison of three audio taxonomy methods for MPEG-7 sound classification. The MPEG-7 sound classification and indexing tools consist of both low-level and high-level description schemes. For the low-level descriptors that we use, low-dimensional features based on spectral basis descriptors are produced in three stages: normalized audio spectrum envelope, principal component analysis, and independent component analysis. High-level description schemes are used thereafter to describe the modeling of audio features, the procedure of audio classification, and retrieval. For classification we test three approaches: the direct approach, the hierarchical approach without hints, and the hierarchical approach with hints. Our experimental results show that the best approach is the hierarchical approach with hints, which results in a classification accuracy of around 99%. The direct approach produces the second best results, and the hierarchical approach without hints the third best results.
منابع مشابه
General sound classification and similarity in MPEG-7
We introduce a system for generalised sound classification and similarity using a machine-learning framework. Applications of the system include automatic classification of environmental sounds, musical instruments, music genre and human speakers. In addition to classification, the system may also be used for computing similarity metrics between a target sound and other sounds in a database. We...
متن کاملSound Effects Taxonomy Management in Production Environments
Categories or classification schemes offer ways of navigating and higher control over the search and retrieval of audio content. The MPEG-7 standard provides description mechanisms and ontology management tools for multimedia documents. We have implemented a classification scheme for sound effects management inspired on the MPEG-7 standard on top of an existing lexical network, WordNet. WordNet...
متن کاملPerformance of MPEG-7 spectral basis representations for retrieval of home video abstract
In this paper, we present a classification and retrieval technique targeted for retrieval of home video abstract using dimension-reduced, decorrelated spectral features of audio content. The feature extraction based on MPEG-7 descriptors consists of three main stages: Normalized Audio Spectrum Envelope (NASE), basis decomposition algorithm and basis projection, obtained by multiplying the NASE ...
متن کاملParameter-Based Categorization for Musical Instrument Retrieval
In the continuing goal of codifying the classification of musical sounds and extracting rules for data mining, we present the following methodology of categorization, based on numerical parameters. The motivation for this paper is based upon the fallibility of Hornbostel and Sachs generic classification scheme, used in Music Information Retrieval for instruments. In eliminating the redundancy a...
متن کاملDefect Image Classification and Retrieval with MPEG-7 Descriptors
In this paper the visual content descriptors defined by the MPEG-7 standard are applied to defect image classification and retrieval. A pre-classified defect image database is used in evaluation. The experiments are done with a KNN classifier and with a PicSOM content-based image retrieval system. Results indicate that the MPEG-7 features work with a high level of success, especially the Color ...
متن کامل